Picture for Tong Zhu

Tong Zhu

ALIGN: Aligned Delegation with Performance Guarantees for Multi-Agent LLM Reasoning

Add code
Jan 28, 2026
Viaarxiv icon

Toward Efficient Agents: Memory, Tool learning, and Planning

Add code
Jan 20, 2026
Viaarxiv icon

DiffThinker: Towards Generative Multimodal Reasoning with Diffusion Models

Add code
Dec 30, 2025
Viaarxiv icon

Bohrium + SciMaster: Building the Infrastructure and Ecosystem for Agentic Science at Scale

Add code
Dec 23, 2025
Figure 1 for Bohrium + SciMaster: Building the Infrastructure and Ecosystem for Agentic Science at Scale
Figure 2 for Bohrium + SciMaster: Building the Infrastructure and Ecosystem for Agentic Science at Scale
Figure 3 for Bohrium + SciMaster: Building the Infrastructure and Ecosystem for Agentic Science at Scale
Figure 4 for Bohrium + SciMaster: Building the Infrastructure and Ecosystem for Agentic Science at Scale
Viaarxiv icon

ATLAS: A High-Difficulty, Multidisciplinary Benchmark for Frontier Scientific Reasoning

Add code
Nov 18, 2025
Viaarxiv icon

LightAgent: Production-level Open-source Agentic AI Framework

Add code
Sep 11, 2025
Viaarxiv icon

UAQFact: Evaluating Factual Knowledge Utilization of LLMs on Unanswerable Questions

Add code
May 29, 2025
Figure 1 for UAQFact: Evaluating Factual Knowledge Utilization of LLMs on Unanswerable Questions
Figure 2 for UAQFact: Evaluating Factual Knowledge Utilization of LLMs on Unanswerable Questions
Figure 3 for UAQFact: Evaluating Factual Knowledge Utilization of LLMs on Unanswerable Questions
Figure 4 for UAQFact: Evaluating Factual Knowledge Utilization of LLMs on Unanswerable Questions
Viaarxiv icon

Effective climate policies for major emission reductions of ozone precursors: Global evidence from two decades

Add code
May 20, 2025
Viaarxiv icon

Incentivizing Truthful Language Models via Peer Elicitation Games

Add code
May 19, 2025
Figure 1 for Incentivizing Truthful Language Models via Peer Elicitation Games
Figure 2 for Incentivizing Truthful Language Models via Peer Elicitation Games
Figure 3 for Incentivizing Truthful Language Models via Peer Elicitation Games
Figure 4 for Incentivizing Truthful Language Models via Peer Elicitation Games
Viaarxiv icon

Chain-of-Tools: Utilizing Massive Unseen Tools in the CoT Reasoning of Frozen Language Models

Add code
Mar 21, 2025
Viaarxiv icon